Journal: Nature Communications
Article Title: Leveraging data-driven self-consistency for high-fidelity gene expression recovery
doi: 10.1038/s41467-022-34595-w
Figure Lengend Snippet: The histograms of the reference data, observed data (1% sampling efficiency), and imputed data by MAGIC, mcImpute, and SERM are shown in the first row of ( a ). Visualization of reference, observed, and imputed data by t-SNE and UMAP are shown in the second and third rows, respectively. t-SNE and UMAP results from SERM imputed data are much better in separating the classes, whereas MAGIC degrades the data due to imputation. The clustering accuracy and cluster quality indices for UMAP visualizations of imputed data from different methods are shown in ( b ). Data are presented as mean values +/− standard deviation (SD). Error bars represent the standard deviation of the indices for n = 1000 different initializations of k-means clustering. Source data are provided as a Source Data file.
Article Snippet: Other distributions can also be included in SERM (see Python/Matlab codes of SERM).
Techniques: Sampling, Standard Deviation